An Algorithm for Determining Talker Location using a Linear Microphone Array and Optimal Hyperbolic Fit

نویسنده

  • Harvey F. Silverman
چکیده

One of the problems for all speech input is the necessity for the talker to be encumbered by a head. mounted, hand-held, or fixed position microphone. An intelfigent, electronically-aimed unidirectional microphone would overcome this problem. Array techniques hold the best promise to bring such a system to practicality. The development of a robust algorithm to determine the location of a talker is a fundamental issue for a microphone-array system. Here, a two-step talker-location algorithm is introduced. Step 1 is a rather conventional filtered cross-correlation method; the cross-correlation between some pair of microphones is determined to high accuracy using a somewhat novel, fast interpolation on the sampled data. Then, using the fact that the delays for a point source should fit a hyperbola, a best hyperbolic fit is obtained using nonlinear optimization. A method which fits the hyperbola directly to peak-picked delays is shown to be far less robust than an algorithm which fits the hyperbola in the cross-correlation space. An efficient, global nonlinear optimization technique, Stochastic region Contraction (SRC) is shown to yield highly accurate (>90%), and computationally efficient, results for a normal ambient.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Practical Issues in the Use of a Frequency-domain Delay Estimator for Microphone-array Applications While the Estimator Has Advantages over Previously Employed Correlation-based Delay Estimation

A frequency-domain delay estimator has been used as the basis of a microphone-array talker location and beamforming system [Brandstein, M.S., and Silverman, H.F., Technical Report LEMS116, (1993)] . While the estimator has advantages over previously employed correlation-based delay estimation methods [Silverman, H.F. and Kirtman, S.E., Computer Speech and Language, 6, 129152 (1990)], including ...

متن کامل

Hands-free speech recognition based on 3-D Viterbi search using a microphone array

A microphone array is the promising solution for realizing hands-free speech recognition in real environments. Accurate talker localization is very important for speech recognition using the microphone array. However localization of a moving talker is di cult in noisy reverberant environments. The talker localization errors degrade the performance of speech recognition. To solve the problem, th...

متن کامل

A Thinning Method of Linear And Planar Array Antennas To Reduce SLL of Radiation Pattern By GWO And ICA Algorithms

In the recent years, the optimization techniques using evolutionary algorithms have been widely used to solve electromagnetic problems. These algorithms use thinning the antenna arrays with the aim of reducing the complexity and thus achieving the optimal solution and decreasing the side lobe level. To obtain the optimal solution, thinning is performed by removing some elements in an array thro...

متن کامل

A Microphone-Array System for Speech Recognition Input

Recent Accomplishments A new, nonlinear optimization algorithm called Stochastic Region Contraction (SRC), has been developed and has been applied to the microphone placement problem, talker location, and talker characterization. We have found that SRC is nearly two orders of magnitude faster than was simulated annealing. Our current research array system has been "hardened", and recal-time, ti...

متن کامل

A closed-form method for finding source locations from microphone-array time-decay estimates

1. ABSTRACT The linear intersection (LI) estimator, a closed-form method for the localization of source positions given only the sensor array time-delay estimate information, is presented. The array is constrained to be composed of 4-element sub-arrays conngured in 2 centered orthogonal pairs. A bearing line in 3-space is estimated from each sub-array and potential source locations are found vi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1990